Algebraic Distribution of Segmental Duplication Lengths in Whole-Genome Sequence Self-Alignments
نویسندگان
چکیده
Distributions of duplicated sequences from genome self-alignment are characterized, including forward and backward alignments in bacteria and eukaryotes. A Markovian process without auto-correlation should generate an exponential distribution expected from local effects of point mutation and selection on localised function; however, the observed distributions show substantial deviation from exponential form--they are roughly algebraic instead--suggesting a novel kind of long-distance correlation that must be non-local in origin.
منابع مشابه
Relationship between Chromosome Rearrangement and Repeat Sequences in Human Chromosome 7
A various types of repeat patterns are abundant in genomic sequence, and are associated with the biological phenomena at distinct levels. In particular, comparative analyses of whole-genome-sized sequence data reveal that the periodic sequences cause the segmental duplication that is a type of chromosomal structural arrangement [2]. In this study, we analyze the relationships between the large-...
متن کاملShort Segmental Duplication: Parsimony in the Growth of Microbial Genomes
∗ We show that textual analysis of microbial complete genomes reveals telling footprints of their early evolution. If a DNA sequence considered as a text in its four bases is sufficiently random, the distribution of frequencies of words of a fixed length from the text should be Poissonian. We point out that in reality, for words less than nine letters complete microbial genomes universally have...
متن کاملShort Segmental Duplication: Model for Growth of Microbial Genomes
We show that textual analysis of microbial complete genomes reveals telling footprints of their early evolution. If a DNA sequence considered as a text in its four bases is sufficiently random, the distribution of frequencies of words of a fixed length from the text should be Poissonian. We point out that in reality, for words less than nine letters complete microbial genomes universally have d...
متن کاملGrowth of microbial genomes by short segmental duplications
A DNA sequence can be analyzed as a text of four letters by counting the times each word in the set of k-letter words occurs in the text. If the text is random and long enough, then the frequencies of word occurrence are expected to obey a Poisson distribution. Examination of complete microbial genomes shows that for k less than 9, the distribution has a width many times the width of a Poisson ...
متن کاملprogressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement
BACKGROUND Multiple genome alignment remains a challenging problem. Effects of recombination including rearrangement, segmental duplication, gain, and loss can create a mosaic pattern of homology even among closely related organisms. METHODOLOGY/PRINCIPAL FINDINGS We describe a new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amou...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 6 شماره
صفحات -
تاریخ انتشار 2011